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TIIK  nCCUHIU'lNCK  OK  FINO.KKPHINT  CilAPACTFIU OTICS 
AS  A TWO-DIMKNSIONAI,  POISSON  PROCKSS 


Stanley  I..  Sc  Jove 

University  of  Illinois  at  Chicago  Circle 

Keu  Wovdr,  X P'unjitrn:  j'intjn'pvintr^;  iJcntificaticu;  cyiyninal  istios; 
tix>-dif^cnnionaL  stivluistw  pi'ooepo;  Mavkov  procssa;  Poisson 
pt’ocrus. 

APiViHAL’T 

Tt>e  individual  il,y  of  a f ingci'pri nt  is  based  on  t.t>e  configuration 
of  occurrences  of  the  ten  Oalton  characteristics.  Tlie  existing  model 
for  tlie  occurrence  of  ttiese  cliaj'acteristics,  in  tenns  of  a grid  of 
cells,  is  further  developed.  The  occurrence  of  the  characteristics 
is  mcxlelleil  ns  a two-dimensional  multivariate  Poisson  process. 


1.  INTIUAIPCTION 

The  indlvidiial ity  of  a fingerprint  is  based  on  the  pattern 
of  occurrence  of  the  ridge-line  det.ails.  These  details,  first 
systematically  studied  by  Galton  (lfl92),  are  now  called  Galton 
characteristics.  They  are  of  ten  types:  islands,  bridges,  spurs, 
dots,  ridge  endings,  forks  (bifurcations),  lakes,  trifurcations, 
double  bifurcations,  and  deltas.  (See  Osterburg,  Parthasoi-athy , 
Raghavan  and  Gclove  (19TT)  for  diagrajns  and  detailed  descriptions 
of  the  characteristics.  ] It  is  desired  to  develop  formulas 
for  the  probability  of  partial  prints,  such  ns  crime-scene  prints. 
In  Osi.erburg,  Partlmsnrathy , Raghavan  and  Gclove  (1977)  such  a 
development  was  made,  according  to  the  J'ollowittg  mtxlol. 
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Assumptioti  1.  A f ingerprinL  is  considered  in  terms  of  u grid 
of  one  millimeter  cells. 

Assumption  2.  For  each  cell  there  are  13  possibilities:  either 
the  cell  is  empty,  or  one  of  the  following  12  possibilities  has 
occurred:  island,  bridge,  spur,  dot,  ending  ridge,  fork,  lake, 

trifurcation,  double  bifurcation,  broken  ridge  (two  ridge  endings), 
or  some  other  multiple  occurj’ence. 

Assumption  3.  There  is  statistical  independence  between  cells. 

Assumptions  2 and  3 beai-  further  study,  with  a view  toward 
developing  better  models. 

In  Sclove  (197^))  tlie  extent  of  departure  from  Assumption  3 
was  studied,  and  the  multinomial  model  was  refined  according 
to  that  dopartiire. 

The  categories  defined  in  Assumption  2 ai'e  somewhat  arbitrary. 

The  ton  categories  corresponding  to  the  occurrence  of  each  of  the 
ten  ciiaracteristics  as  singletons  are  natural  enougli;  it  is  the 
special  treatjnent  accorded  two  ridge  endings  and  the  lumping 
together  of  all  other  multiple  occurrences  wiiicli  w.arrant  alternative 
treatmctit.  In  tlie  present  paper  a mettiod  is  presented  whicti  not  only 
takes  account  of  inter -cell  dependence  but  also  provides  a different 
treatment  of  multiple  occurrences. 

In  the  present  paper  the  occurrence  of  fingerprint  characteristics 
is  modelled  as  a two-dimensional  Poisson  process,  taking  into  account 
the  dependence  between  cells  and  providing  alternative  treatment 
of  multiple  occurrences. 

A configuration  such  as  that  of  Figure  1 represents  the  result 
of  placij:g  a grid  of  one-millimeter  squares  over  a partial  fingerprint. 
It  has  1*3  cells,  37  of  them  empty,  the  ot.her  six  being  occupied  by 
•*  ridge  endings  and  2 forks.  Its  probability,  estimated  by  the  method 
of  Osterburg,  Parth.asarathy , Raghavan  and  Pclove  (1977)  is  given  by 
-logj^QP  = 11. It,  where  P denotes  the  probability  and  P its  estimate. 
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Coiif  iRUfat.  ion  of  If?  coll;',  with  !i  enclitip  fidRon  and  ? forks. 
0 = ('mpty  coll,  K = ondiiiR  ridRo,  F = fcii'k. 


MOnKLL.lNC.  DKi'FNl.'KNGi:  AtiONC.  CEl.Li 


LoL  the  cells  be  munbereJ  In  some  fixed  order,  say,  as  one  reads 
a laiiRuaRe  suci>  as  FiiRlisli,  starting  wLtii  ttie  top  row  and  moviiiR 
from  left  to  rigiit  witliin  eacli  row. 
giving  tiie  outcome  in  tlie  c-tli  cell, 

Y, 


hot  Y be  a random  vector 
c 


Y = (Y 
c 


lc’‘?c 10, c'’ 


c = 1,0, ...,t  = t.otal  number  of  cells  in  tlie  print.,  where,  for 


V = 1, ?,...,  10  characteristics,  Y^^^  = number  of  occurrences  of  the 


v-th  characteristic  in  the  c-th  cell. 

Let  r denot.e  the  j'robabi li t.y  of  a configuration.  Then 
r = r(configuration)  = Yj=yj  , Y^'y^*  • • • »Y^=y^  ) . 

This  can  be  written  as 

r = P(Yj=y^)P(Y^=y^lY^)r{Y^=y^lY,,Y^)---r(Y^=yjY^^_^ Y^). 

Under  Assumption  3,  independence  lunonp  ceils,  expression  (2.1) 
simplifies  to 

r = r(Y^=y^)r(Y^=y^)--'r(Y^=yj^). 

Under  this  assumption,  all  tli.at  would  be  reiinired  to  obt.ain  a model 
to  give  r would  be  to  describe  the  wit.hin-cell  distributions  of  the 


(2.1) 


ten  characteristics,  i.e.,  the  Joint  distribution  of  Y^^ . Y^^ , . . . , 
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However,  any  departure  from  Assumption  3 means  tliat  we  nee^l  to  model 
not  only  what  happens  within  cells  but  also  to  model  the  dependence 
between  cells. 

As  a step  toward  mcdellinf'  dependence,  we  introduce 
Assumption  3!  The  outcome  in  the  c-th  cell  depends  on  the  outcomes  in 
the  other  cells  only  tlu'ouch  the  outcomes  in  adjacent  cells. 

Due  to  the  fact  that  (2.1)  forces  one  to  use  a linear  ordering 
of  the  cells,  one  must  develop  the  model  in  terms  of  the  four  preceding 
adjacent  cells  rather  than  all  eight  adjacent  cells.  More  precisely, 
under  Assumption  3’,  the  conditional  probability  P(Y  =y  |Y  ,,Y 

C C C -1  C — c 

...,Y^)  will  not  depend  iipon  all  of  Y^  only  upon 

foui‘  of  these  variables,  namely,  those  corresponding  to  the  cell  to 
the  left  (west)  of  cell  c,  the  cell  above  (north  of)  cell  c,  the 
cell  just  northwest  of  cell  c,  and  the  cell  just  northeast  of  cell  c. 
[If  the  configuration  were  rectangular  and  cells  were  indexed  as  (i,j), 
i = J=1,...,J,  then  the  conditional  distribution  of  the  vai'i- 

able  Y^  given  the  variables  preceding  it  in  the  ordering,  namely 

'^i,j-r’--’'^i,r^i-i,j’''i-i,.i-i’---’^i-i,i’---’^i’ 

upon  the  four  variables  Y.  , ,,Y.  , , Y .]  If  W is 

:,J-1’  1-1, J+1’  1-1, j,  i-],j-l  c 

a matrix  whose  columns  are  t.he  four  preceding  neiglibors  of  Y^,  then 
Assumption  3'  is 

" P<V^ol"c'  ■ 

Assumption  3'  may  be  viewed  as  an  assumption  that  the  process  is  a 
Markov  process. 

2.1.  Data  Analysis 

A between-cells  data  analysis  is  discussed  in  Sclove  (1970). 

The  results  were  that  the  probability  that  a cell  is  occupied  increases 
monotonically  with  the  number  of  its  neighbors  that  are  occupied. 
Accordingly,  we  introduce 

Assumption  U' . The  exi'ected  number  of  occurrences  in  a coll  depends 
upon  the  number  of  neigliboring  cells  depends  upon  the  number  of  neigh- 
boring cells  that  are  occupied. 

A within-cells  data  analysis  is  discussed  in  Appendix  A.  It 
was  found  that  negative  binomial  distributions  provided  a good  fit 
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to  Mie  liist  rit''.itioii  of  t tio  tmml'ir  iif  ctru’iic  t.(>!- i st  i cr , aii'l  to  the 
trinberr.  of  different  oir\r;ief  er  i ;•(  i cr> . ’Ihir>  is  cotir.istent  witti  a 
model  of  a mixture  of  I’oisr.on  d i nt»  i hut  ioiin,  for  a iu>vitive  binomial 
d i st  I'i  but  ion  can  be  obtained  an  a mixture  of  ToisKon  d i r.tr  ibut  i onr. . 
AcGor.iin^Tly,  we  rdiall  make 

Ar.r.uwnt.  ion  !i ' ' . 'I'tii-  nu’iibej-  of  occuri'ei\cer.  in  a cell  is  a I'oir.r.on 
rairl on  variable. 

Aasumpt  ions  !<’  an  i li ' ' tv'ml)in('  into 

Assumption  !i . The  numt'er  of  o('cui'reiU'es  in  a cell  is  distributed 

.accordin*;  t(’  a I’oisson  d i st  I'i  but  iot\  with  par;uneter  \ which  depends 

upon  the  rand  ni  variable  a,  the  nunbt'i'  of  adjacent  cells  which 

aie  oceupit^i:  V = \(a),  a ” f*,!,.',',  c'r  . 

Thus,  lett  inr,  N be  the  i\uniber  of  occur i‘ oncer,  in  eell  c,  we 
c 

have 

r(N  =n(w  ) = I'tN  =n(A  =a)  = . 

c ' e c c n ! 

Mow,  this  is  the  disti'ibut  ion  of  the  tu' ta  1 number  of  c'ccurrenees  of 

all  ten  ty]'es  in  a cell.  The  I'olative  fretpienc  ice.  p^^  of  the  types, 

V = 1 , n,  . . . , 10  typtu;,  are  );iven  in  Table  1.  It  is  reasonable  to 

assiune  that  the  expected  v.alue  of  the  nuii'.l)er  of  occur I'ences  1 

ve 

of  the  v-th  charact.er  i st  ic  in  the  c-th  ct'll,  p.iven  A =u , ir.  \(a)p  , 

C V 


V = 

1.2.. 

..,10.  We 

!'.h  1 11  a 

^le' 

• • ''‘10,.'’  ' 

0 

PpP 

'0  • • • • 

« 

.r,,V  This 

i V oi; 

Assumi't  ion 


" w V ' ‘'i  ’'2  •••''10 

' -Me -'10,0  ^ 

Assumptions  It  ;u>d  S combined  p. ive  the  result,  that,  the  marginal 

Joint  distriliut.ion  of  Y,  ,Y„  Y,„  (marginal  in  the  sense  of 

Ic  ,r  U>,c 

aver.aging  over  the  distribution  of  1 is  that  of  independent  Poisson 

viu'iables,  the  pariunoler  of  Y being  \(a  )p  : 

VC  e v 

p(y  =v  |w  =w  ) = r(Y  =y  |a  =a ) = I!  ^ . 

c c c e e c c 
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Relative  Frequcncien 

TAlUl',  1 

of  the  Ten  Gal  ton 

Character i sties 

Character  is  tic 

Fretiuency* 

Relative 
Frequency ' 

Ending  ridge 

irliT 

.l»9T 

Fork 

300 

.159 

I'elta 

3li 

.0135 

I'pur 

B8 

.0350 

Dot 

28'’’ 

.10.7 

Island  ridge 

o‘.)Q 

. 103 

l\ve  (lake) 

00 

. 0263 

Double  bifurcation 

lO 

.00637 

Tr  i furcation 

7 

.00279 

Bridge 

ihO 

.0558 

2‘31 1 

1 . 00 

Pfisod  on  flS91  colls 


Border  Cells 


Cells  at  the  border,  not  beiiiR  touched  by  tlu'  full  complement 
of  four  preceding  adjacent  cells,  recp.iire  some  special  t.reatment. 

One  could  take  the  results  in  bordei’  cells  as  given  and  take  tdie 
probabilities  for  t.he  ottier  cells  conditionally  on  t lie  outcomes  in 
the  border  cells,  but  this  would  result  in  considerable  reduction 
in  the  effective  sample  siso.  (E.g.,  lO  of  the  1*3  cells  in  Figiu'e  1 
are  border  cells.)  Wo  wish  to  use  such  information  as  is  present. 

I.e.,  stime  but  not  all  It  preceding  adjacent  cells  are  sometimes 
present;  we  make. use  of  this.  For  example,  if  there  are  3 adjacent 
cells  and  2 are  occupied,  then  we  know  that  if  all  li  cells  were  present, 
then  2,  3 oi'  It  of  them  would  be  occupied.  To  use  such  information 
we  need  to  know  the  conditional  expected  value  of  the  number  of 
occurrences,  given  that  the  number  of  adjacencies.  A,  is  in  the 


r 


interval  lnj,a^l,  where  Hccall  that  Is  the  tnimber  of 


occurrences  in  coll  c tind  A in  the  number  of  the  four  preceding 


adjacent  cella  that  are  occupied.  Then  the  cotulitlonal  expected 


value  we  need  is  E[ N |a,<A  <n^)  = l(a, ,n^),  say.  Note  that 

c 1—  c—  2 12 


X(a,  ,a^)  = X(n)F(A  =a)/5:‘-  F(A  =n). 
1 r c _ e 


n=a. 


n=n. 
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K:'. t imato:'.  of  I'oiul i t i oiial  V'xpooto.I  Nur.l'or  of  Oi’cufrrncos 
for  I’orvlor  Colls,  V(aj^,a,l 
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CCMI'UTATICN  OF  I'KOBABl  l.l  I'l  FO  OK 

VAHIOUC 

CONKUIUKATIONC' 

We 

‘ nrt'  !iov  in  a jv; 

lit  ion  to  ti:io  Mt 

e mo.lol 

to  os  timate  flio 

I'rotvil'i  I i t ios  v.'f  varioiii 

oonf  i(;\n  at  ions 

. Wo  ill  list  rate  vith  the 

oonf  ipurat  ion  of  Kigiiro 

1.  Tal'lo  IV  il  hist  rat 

os  the  computation 

It  mn.v  t't'  tu'lpl'ul  to  I'liu' ivt.'it  t*  t !ii’  stop::  Iti  the  oivMt'ut  nt  i on , sisK'o 
ot  hor  loiu'.’uo  m;\y  wi::li  to  ccip!  >y  :i  t u’o- | imons  i cti.i  1 

Vo  iro'.on-mvi  1 1 i luri  ial  Markov  iiualol  in  I'la'l’l  I'm::  Ihoy  :u'<'  workiiv;  on. 

'I’tu'  stops  ai'i'  ns  TolU'w;:. 

(l^  I'oiiiit  [)<••  imrjl't'i-  v'l’  'Hi.'aooiu' i I'S  I'o!'  ('aoh  ooll  . 
i.'^  U'ok  np  ‘ho  oovrt'oi'oii  i i np.  oxpootovl  numbor  o'!'  ooovnionoo;;  in 
I'aMo  111. 

(O  pivnpiito  the  roi::;:oi\  probability  of  thi::  luimbor  of  ooenr  renoos ; 
ill)  TJoto  ttio  typo;:  v'f  ohar  ao  t or  i ;:  t io  .s  in  the  eell,  if  any. 

( S ) (.’ompute  t ht'  mn  1 1 i luii:  i a I prol':it'i  1 i ty  of  this  oonf  ipnrat  ion  cf 
typo:',  in  '\  Ooll. 

it')  Multiply  t ht-  ro'.ail'::  of  s'-top;:  lO  aii'i  i ^ to  ot'tain  the  proba- 
bility for  t b.o  oi'l  1 . 

it)  Multiply  the  rer.nlt;:  for  ;tll  oells  to  obtain  tlio  probability 
for  the  whole  I'onf  ipuiait  ion. 

'I'he  result,  whiolt  will  bt'  a vi-ry  ::’t\a  1 1 probability,  i r.  best  exprosse.i 
on  a ni'pat  ive  lop,  r.oaU'.  [This  moatts  that  in  stop  if)  we  ahd  the 
lops  of  the  nnmbers  ratlior  th:in  mnitiply  (ho  nnmt'ers  ( ht^mso  ivi's . I 

Tills  lioais  with  t ho  ass  iptu'-.ont  of  a prol'ability  1'  to  the  ooour- 
renoe  of  a piven  o.'nf ip  irat  ion  in  a pivt  n s.ot  I'f  oells.  For  inferen- 
tial purposes  it  i nooo;:sary  to  o;:ti!iiate  the  probability  t liat  a 
person  has.  this  oonf  ipnrat  ion  ^oriowhta'y.'  oji  't  t fi  niters.  This  problem 
is  disetisse,!  in  ttootion  of  Osterbnrp,  I'art  hasiira  t hy , Haphavan 
and  IVlove  (lOff).  Further  problems  of  inferenoe  oonoerninp  the 
identity  of  the  pt'rson  who  loft  a print  are  considered  in  boction  It 
of  t he  s.'une  p:ipor . 

Now  we  o’lrry  out  the  above  st  t'ps  for  the  oonf  ipurat  ion  of 
Fipure  I ITable  IV ). 
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I 

I 


I 


I 


[ 

I 


1 


1 

1 

I 

1 
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TAm.K  IV 

Computation  of  Kstinuitc  of  Probability  of  Conf in'irntion  of  Figure  J 


‘r"2 

Number  i’olsson  prob.  of 
in  tills  tmmbor 

1 cel ] i n CO  1 I 

Types 
In  cell 

Multinomial  i 

prob.  of 

t.liesc 

types  Cell  probability 

Number 

of 

sue  h 
cells 

0,C 

0.222 

0 

C-^'--''=:.0Ol 

0 

1 

.801 

6 

0,0 

0.222 

1 

0.222c"^  178 

F 

. It  97 

. 178(  .1(97)=.  188 

3 

0,0 

0.222 

1 

0.222c  JYrt 

F 

. 159 

. 178( .159)  = .028 

1 

0,1 

0.277 

0 

c-"-"^^.758 

0 

1 

.750 

2 

0,2 

0.105 

.0 

e-«-’«'^=.737 

0 

1 

.737 

It 

0,2 

0.^05 

0 

0.  J05c"'^‘  ^^'^=.225 

!•: 

. >197 

.?25(.>i97)  = .112 

1 

0,  1 

0.321 

0 

c-‘’-^'’^..72l. 

0 

1 

.72I4 

5 

0,li 

0.  nf 

0 

.-^’•^^■^.71)1 

0 

1 

.71I1 

1 

1,1 

o.:iii5 

0 

e-"-'''^^=.708 

0 

1. 

.708 

13 

1,1 

0.3>''3 

1 

0.3>t5c"'’’^''^=.2Mi 

F 

.159 

.2'ilt(  .159)  = .0,19 

1 

1,2 

0.369 

0 

-6-369 

0 =.091 

0 

1 

.691 

It 

1,3 

0. 389 

0 

c-“' 

0 

1 

.678 

1 

2,2 

0.  Ii02 

0 

0 

1 

.669 

_1^ 

-logjo^’  = 

-6  1 og  j p . 80 1 - 3 1 og  j . 088 

- ... 

- logj^.669  = 32.9 

It  3 

Tlio  or.timatoii  negative  log  probability  in  11?. 9*  Conijuire  this  with 
the  figure  of  11. li  given  [Ontorbtirg,  rartbararatby , Hagbavan  anti  Gclove 
(19TT)1  by  the  appi'oximation  l)ar.e(l  on  intieperuienco;  tbc  ratio  of 
probaiiilitics  is  about.  3/?:i..  'I’liis  difforcnce  is  unimportant  since 
we  are  interest. cd  only  in  order  of  mag.nit.U'ie . 

As  a second  extunple,  consider  tbc  conf igural.i on  of  Figure  2, 
in  which  the  occurrences  are  still  all  singletons  but  are  tightly 
clustered.  The  negat.  ive  log  pi'obabillty , estimatotl  by  the  method 
of  this  paper,  is  11.2,  comimred  to  .11. for  the  met. hod  based  on 
indejicndencc.  'I'be  ratio  of  tliese  t.wo  probability  estimates  is  1.6j1. 
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n 

b 

c 

<i 

e 

f 

R 

h 


1 

(1 

0 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 

n 


(1 

V 

V. 

0 

0 

0 

(1 

1) 


ll 

(1 

K 

V 

V, 

0 

(1 

0 

no. 


0 

K 

o 

0 

0 

0 

0 


r. 

0 

0 

0 

o 

0 

0 

0 


Ooiificurntion  c'f  In  cm',!!'-.  with  li  ondinr.  riilf’cs  mivI  f('rk! 
!is  in  KIr.  1,  bill,  lu'i'o  t ho  six  occurrences  are  clur-l  ercd. 


V 


I 


1 

I 

I 

I 


/\s  a iiiird  ex.Tjiiple,  lei.  )is  sbill  consivier  a conf i^ural  ion 
of  'n  celJs  wit.l)  It  ending,  ridRos  and  ?.  forks,  as  in  KiRures  1 and 


2, 

but  now  suppo 

GC  Gomo 

of 

tlieso 

occ'ur 

not,  as  s i IV’.  1 ot.ons  but.  ratlier 

as 

mill  tiple  occurrences 

in 

cells 

(I'M  pure  1), 

1 

«. 

1 

It 

'S 

6 

a 

(1 

0 

0 

0 

0 

0 

b 

(1 

0 

!•’ 

f;I'' 

(’1 

0 

c 

0 

0 

0 

v.v.v. 

0 

0 

d 

0 

0 

0 

0 

0 

0 

c 

0 

0 

0 

0 

0 

0 

f 

(■» 

0 

0 

0 

0 

0 

R 

0 

' 0 

0 

0 

0 

0 

h 

0 

FIO. 

1 

Conf iRiiration  of 
ripni-es  ] and  P, 
a doublet. on,  .'Uid 

l|  ,1  cell  s 
but  witti 
a triplet 

wltli  •• 
t lie  six 

endinp  ridpea  and  P forks,  as  in 
occurrinicos  as  a sinpleton. 

The  new  feature  i 

of  tills 

ex.ampJe  is  t.he 

multiple  oeeurronces  in  ceils 

(b,l|)  and  (c,li).  FinditiR  t tie  probability  of  l.tie  doubleton  in  coll 
(b,li)  involves  tlic  foltowiriR  comput.ati on . 
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Number  of  adjacencies  = 1 

Expected  number  of  occurrences  = 0.3^5 


Poisson  probability  _o.3l45  o 

of  2 occurrences  = e ‘ - 'gj — 


= U. 21  X 10 


-2 


Multinomial  probability  pin  n 

that  the  2 occurrences  = ( ) .li9T^.159'*’* 0135^' *•  .0558 

are  E and  F 1,1,0,. .. ,U 

= 7.85  X 10"'^ 

Prob.  for  cell  (b,U)  = (7.85  x 10"^)(1|.21  x io~^)  = 3.31x10"^ 


The  computation  of  the  probability  of  the  triplet  EEE  in  cell  (c,U) 
is  as  follows. 


Number  of  adjacencies  = 2 

Expected  number  of  occurrences  = 0.1402 

Poisson  probability  0.1402^  , -3 

of  3 occurrences  = e — — = 7.214  x lo 


Multinomial  probability 

that  the  3 occurrences  = « 

are  E,E,E 


q).  1497^.  159°*  ••.0558° 


= .123 

Prob.  for  cell  (c,l4)  = .123(7.214  x lo"^)  = 8.89  x 10 


Computing  the  probabilities  for  all  the  cells  and  multiplying  gives 
-log^pP  = 11.7  for  this  configuration.  The  method  given  in  Osterburg, 
ParbhasEirathy , Raghavan  and  Sclove  (1977)  lumps  all  multiple  occurrences 
together,  assigning  a cell  with  amy  such  multiple  occurrence  a probabil- 
ity of  .0355.  A fork  has  a probability  of  .0382,  an  empty  cell,  a 

'*■  ho  PI 

probability  of  .766;  hence  P = .766  .0355  .0382  , -logj^pP  = 9.0. 

The  difference  is  negative  log  probabilities  provided  by  the  two 

2.7 

methods  is  11.7  - 9.0  = 2.7;  10  = 500.  The  difference  begins 

to  be  sizable  but  is  not  particularly  important  in  terms  of  order 
of  magnitude,  not  when  one  is  more  interested  simply  in  discriminating 
between  probabilities  on  the  order  of  one  in  a thousand  and  those 
on  the  order  of  one  in  a million  millions. 
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As  a fourth  example,  let  us  consider  again  a total  of  I43  cells 

with  6 occurrences,  two  forks  and  four  ridge  endings,  but  all  in 

one  cell.  This  is  to  be  ccasidcrod  as  a mathematical  extreme  and 

not  an  empirical  example:  We  never  observed  more  than  5 occurrences 

in  a single  cell  and  these  were  combinations  of  ridge  endings  and 

dots.  The  conditional  expected  number  of  occvirrenees  in  a single 

cell,  with  no  aeijacent  <'ella  occupied,  is  O.PP?;  the  corresponding 

—0  222  ■ 6 -7 

Poisson  probability  of  6 occurrences  is  e ’ 0.222  /6l  = 1.33 '<10 

Obtaining  the  relative  frequencies  from  Table  I,  we  compute  the  multi- 
nomial probability  of  2 forks  and  U ridge  endings,  given  6 occurrences, 
as 


(,j  p 159". 0135^’- ••.0558^  = 15(.>t9T)''(.159)‘ 


= 2.31  X lo"^. 

The  probability  for  the  cell  is  thus  (1.33  x 10  ^)(2.31  x 10 
= 3.08  X 10  . Computing  the  probabilities  for  the  other  ceils  by 

following  the  steps  outlined  and  combining  the  results  for  all  cells 
gives  -log^^P  = 13. for  this  configuration.  The  method  given  in 
Osterburg,  Parthasarathy , Raghavati  and  Sclove  (1977)  gives 
= 6.32  for  this  configuration.  That  method  gives  such  a high  probabil- 
ity compared  to  the  method  of  the  present  paper  because  that  method 
lumps  all  multiple  occurrences  together,  as  mentioned  above,  assigning 
a cell  with  any  sucli  multiple  occurrence  a probability  of  .0355, 
or  a weight  of  I.I45  = -logj^p.0355.  Using  that  metliod,  one  would 
assign  that  probability  to  the  cell  containitig  the  multiple  occurrence 
EEEEFF'.  The  method  of  the  present  paper  gives  a probability  of 
3.08  X 10  i.e.,  a weight  of  8.51  = -log^l^P  to  this  multiple 

occurrence.  This  underscores  the  advance  made  by  the  present  method, 
compared  to  the  method  in  Osterburg,  Parthasarathy,  Raghavan  and 
Sclove  (1977)-  A criticism  of  the  present  method  is  that  it  is 
too  dependent  upon  the  I'lacement  of  the  grid  of  cells,  which  is 
arbitrary,  in  tliat  multiple  occuri-ences  in  a single  cell  could  liave 
been  placed  in  two  cells  if  the  grid  liad  been  placed  differently. 
However,  the  effect  of  this  appears  not  to  be  lai'ge.  For  example. 
Figures  h(n)  and  (b)  show  portions  of  two  configurations,  both  with 
•4  occurrences.  In  Fig.  l4(n)  the  I4  occurrences  are  all  in  one  cell; 


T ^ 

r 
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in  Fig.  U(b)  they  are  in  two  ceUn.  The  Poisson  probability  for 
the  12  cells  of  Fig.  It(a)  is  ( e"°’ . 222e"°‘^^^)^(e"°' 
giving  -log  P = 2.02.  The  Poisson  probability  for  the  12  cells 
of  Fi«.  1,(1.)  ia  (e-°•""^)5(.^22e-»■22^)l(,-''•3l*5,3(„.3^5^-0.3l,5)l 
_n  iioo  o 

P = 2.61*.  The  difference  is  2.6h  - 2.02  = 0.62; 

0.62 


10 


10 


= *>.2,  not  much  difference  at  all, 


0 0 0 0 

0 xxxx  0 0 

0 0 0 0 


0 0 0 0 

0 XX  XX  0 

0 0 0 0 


FIG.  lt(a)  FIG.  l»(b) 

Portions  of  two  configur.ations.  (a):  occurrences  in  a single  cell, 

(b):  *4  occurrences,  2 in  one  cell  and  2 in  the  next.  (The  symbol 

X indicates  an  occurrence  of  some  type. ) 


APPENDIX:  MARGINAL  DISTRIBUTION  OF  THE  CHARACTERISTICS 

The  distribution  of  the  number  of  occurrences  per  cell  is  given  in 
Table  V. 


TABLE  V 

Distribution  of  Number  of  Occurrences. 


Number  of 


occurrences 

0 

1 

2 

3 

I4 

5 

Total 

Number  of 
cells 

65814 

159l< 

320 

72 

19 

2 

8591 

Proportion 
of  cells 

.766 

.185 

.0372 

. 00838 

.0022 

. 00023 

1.00 

This  distribution  is  well  fit  by  a negative  binomial  distribution. 

A Poisson  or  gecmietric  distribution  is  not  adequate.  An  interpreta- 
tion of  this  finding  is  that  what  is  involved  is  a gamma-type  mixture 
of  Poisson  distributions  [see,  e.g. , Parzen(l962) , p.  57],  resulting 
in  a negative  bincxnial  distribution,  as  in  the  classical  accident 
studies  of  Greenwood  and  Yule  (1920).  We  remark  that  we  proceeded 
with  parameter  estimation  and  the  fitting  of  distributions  as  if 


we  had  strict  betwcen-cell  indepcndonce  thouRli  we  have  shown  that 
this  does  not  hold  strictly.  Wc  do  note,  however,  that  our  paraneter 
estimates  are  based  upon  (at  most)  the  first  two  sample  moments 
and  that  the  sample  moment  is  unbiased  for  the  corresponding  population 
moment,, even  witliout  independence.  The  chi-square  p,oodness-of-f it 
tests,  on  the  other  hand,  are  baseci  on  an  assumption  of  independent 
trials. 

Empirical  support  for  the  assumption  of  a local  Poisson  process, 

i.e.,  a Poisson  process  where  the  intensity  parameter  varies  (a  non- 

homogeneous  Poisson  process)  is  given,  then,  by  the  fact  that  a 

negative  binomial  distribution  fits.  Theoretical  support  for  the 

assumption  is  demonstrated  by  the  plausibility  of  the  following 

assumptions.  (These  are  the  usual  aximoms  for  a Poisson  process 

(see,  e.g.,  Parzen  (1962),  p.  119),  generalized  to  two  dimensions.] 

Given  any  set  in  the  (x,y)-plane,  Let  N(S)  be  the  number  of  occurrences 

in  S and  let  a(s)  be  the  area  of  S.  Given  any  point  (x,y),  let 

{S  } be  a sequence  of  sets  tending  to  (x,y):  lim  G = ((x,y)}. 
n n n 

Then  the  followit\g  assiunptions  are  plausible.  There  is  a positive 
number  A(x,y)  such  that 


and 


lim 


n-Mo 


lim 


n-vit' 


lim 


n-Kx) 


1 - P(N(S  )=0] 
n 


r[N(s  ) = 1] 
n 

a(s  ) 

n 

P|N(S  )>2] 


0 . 


A(x,y) , 
A(x,y) , 


ACKNOWLEDGEMENTG 

The  initial  investigation,  reported  in  Osterburg,  Farthasarathy, 
Raghavan  and  Sclove  (l977),  was  supported  under  a contract  with  the 
Center  for  Research  in  Criminal  Justice,  University  of  Illinois  at 
Chicago  Circle.  Work  on  the  cun-ent  report,  was  supported  under 
Grant  AFOSR  77-3*t5^  from  the  Air  Force  Office  of  Scientific  Research. 
The  author  gratefully  acknowledges  these  sources  of  support. 


_ll,_ 


BIDLUKIKAPHY 


Galton,  F.(l89^').  KiriKer  rrint-.'.  London:  Macmillan;  republication 
(1965),  New  York:  DaCapo  Press. 

Greenwood,  M.  & Yule,  G.U.(1920).  An  inquiry  into  the  nature  of 

frequency-distributions  representative  of  multiple  happenings 
with  particular  reference  to  tlic  occurrence  of  multiple  attacks 
of  disease  or  of  repented  accidents.  Jour.  Royal  Statist.  Soc. 

255. 

Osterburg,  J.W.,  Parthnsarathy , Raghnvan,  T.K.S  & Gclove,  S.L. 
(1977).  Development  of  a matliematical  formula  for  the  calcu- 
lation of  fingerprint  probabilities  based  on  individual  charac- 
teristics. Jour.  Amer.  Statist.  Assoc. 72,  772-8. 

Parzen,  E.(l962).  Stochastic  Processes.  San  Francisco:  Holden-Day. 

Sclove,  S.L. (1978).  The  occurrence  of  fingerprint  characteristics 
as  a two-dimensional  process.  Technical  Report,  AFOSR  Grant 
77-3*i5**,  February  1'3,  1978. 


